MemPype: a pipeline for the annotation of eukaryotic membrane proteins
نویسندگان
چکیده
MemPype is a Python-based pipeline including previously published methods for the prediction of signal peptides (SPEP), glycophosphatidylinositol (GPI) anchors (PredGPI), all-alpha membrane topology (ENSEMBLE), and a recent method (MemLoci) that specifically discriminates the localization of eukaryotic membrane proteins in: 'cell membrane', 'internal membranes', 'organelle membranes'. MemLoci scores with accuracy of 70% and generalized correlation coefficient (GCC) of 0.50 on a rigorous homology-unbiased validation set and overpasses other predictors for subcellular localization. The annotation process is based both on inheritance through homology and computational methods. Each submitted protein first retrieves, when available, up to 25 similar proteins (with sequence identity ≥50% and alignment coverage ≥50% on both sequences). This helps the identification of membrane-associated proteins and detailed localization tags. Each protein is also filtered for the presence of a GPI anchor [0.8% false positive rate (FPR)]. A positive score of GPI anchor prediction labels the sequence as exposed to 'Cell surface'. Concomitantly the sequence is analysed for the presence of a signal peptide and classified with MemLoci into one of three discriminated classes. Finally the sequence is filtered for predicting its putative all-alpha protein membrane topology (FPR <1%). The web server is available at: http://mu2py.biocomp.unibo.it/mempype.
منابع مشابه
Eukaryotic Genome Annotation Pipeline
The NCBI Eukaryotic Genome Annotation Pipeline is an automated pipeline producing annotation of coding and non-coding genes, transcripts, and proteins on finished and unfinished public genome assemblies. It provides content for various NCBI resources including Nucleotide, Protein, BLAST, Gene, and the Map Viewer genome browser. The pipeline uses a modular framework for the execution of all ann...
متن کاملA Novel Vector for Expression/Secretion of Properly Folded Eukaryotic Proteins: a Comparative Study on Cytoplasmic and Periplasmic Expression of Human Epidermal Growth Factor in E. coli
Expression of eukaryotic proteins in E. coli often results in their aggregation. Proper folding and solubility of therapeutical proteins are the pre-requisite for their bioactivity. This is not achieved in cytoplasmic expression in E. coli because of the absence of disulfide bonds formation. A novel expression/secretion vector was constructed which exploited β-lactamase signal sequence to trans...
متن کاملA survey of integral a-helical membrane proteins
Membrane proteins serve as cellular gatekeepers, regulators, and sensors. Prior studies have explored the functional breadth and evolution of proteins and families of particular interest, such as the diversity of transport-associated membrane protein families in prokaryotes and eukaryotes, the composition of integral membrane proteins, and family classification of all human G-protein coupled re...
متن کاملTransportDB 2.0: a database for exploring membrane transporters in sequenced genomes from all domains of life
All cellular life contains an extensive array of membrane transport proteins. The vast majority of these transporters have not been experimentally characterized. We have developed a bioinformatic pipeline to identify and annotate complete sets of transporters in any sequenced genome. This pipeline is now fully automated enabling it to better keep pace with the accelerating rate of genome sequen...
متن کاملDesigning and Development of a DNA Vaccine Based On Structural Proteins of Hepatitis C Virus
Background: Hepatitis C virus (HCV) infection is one of the most prevalent infectious diseases responsible for high morbidity and mortality worldwide. Therefore, designing new and effective therapeutics is of great importance. The aim of the current study was to construct a DNA vaccine containing structural proteins of HCV and evaluation of its expression in a eukaryot...
متن کامل